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Abstract 

The "dumpiness" matrix of a network is used to develop a method to identify its commu- 
nity structure. A "projection space" is constructed from the eigenvectors of the dumpiness 
matrix and a border hne is defined using some kind of angular distance in this space. The 
community structure of the network is identified using this borderline and/or hierarchical 
clustering methods. The performance of our algorithm is tested on some computer-generated 
and real-world networks. The accuracy of the results is checked using normalized mutual 
information. The effect of community size heterogeneity on the accuracy of the method is 
also discussed. 
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1 Introduction 

Discovering the structure of complex networks and understanding the underlying laws that lead to 
such structures is probably one of the most important problems in complex network theory [1]. In 
the smallest scale (nodes) the topology of a network may be described by its degree distribution 
and in the largest scale global properties such as diameter of the network, mean degree, clustering 
coefficient, and shortest mean path are significant. In a middle scale, properties such as community 
structure are important. In recent years there have been many efforts to define and find the 
community structure of complex networks [2, 3, 4, 5]. Spectral algorithms are of most celebrated 
community detection methods. These algorithms usually use the eigenvalues and eigenvectors of 
a matrix corresponding to the network to find its community structure. The effectiveness of a 
method turns out to depend on the structure of the network itself. For example some methods 
give accurate results for networks with many loops, but are not good for tree like networks [3]. 

In this paper we use the recently introduced "dumpiness matrix" of a network [6] to identify 
its community structure. The dumpiness measure was introduced to give more information about 
the structure of networks and can be used as a measure to classify them. Roughly speaking the 
dumpiness measure shows how high degree nodes are close to each other. It turns out that the 
dumpiness matrix is useful in community detection of a network. 

The organization of the paper is as follows. In section 2 we introduce our method of community 
detection using the dumpiness matrix and give a qualitative justification for the effectiveness of the 
method. In sections 3 the method is applied to some computer-generated and real-world networks 
with two and more than two communities. Then an improvement on the algorithm is suggested 
and the performance of the method is tested. Section 4 is devoted to summary and concluding 
remarks. 

2 Community detection using the dumpiness matrix 

In this section we develop our algorithm for detecting communities using the dumpiness matrix 
and discuss qualitatively how the dumpiness eigenvectors can help to identify the community 
structure of a network. 
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2.1 dumpiness Matrix 



dumpiness matrix(S) was initially introduced by Estrada et al [6] to extract parameters for 
classifying complex networks. For a network with n nodes this is an n x n matrix, with elements 




where ki is the degree of node i and dij is the magnitude of the shortest path between nodes i 
and j. Let Vi, V2, Vn denote the eigenvectors corresponding to the (non-increasing) sequence 
of eigenvalues £1, £2, of the dumpiness matrix respectively. Since the elements of S are 

non-negative; elements of Vi are all of the same sign according to the Perron-Frobenius theorem. 
Therefore, the elements of V2 can be positive or negative in order to satisfy orthogonality condition. 
Element Vi{q) of Vi = (f j(l), f j(2), f w j(n)) corresponds to the node q of the network. 

In networks with two communities, observations show that nodes with positive elements of V2 
belong to the same community. This is also the case for negative elements. However, the border 
between these two communities is not exactly zero. In other words in some cases a node with a 
negative element belongs to the community of nodes with positive elements or vice versa. Hence, 
in order to detect communities, we need a more strict method rather than merely considering the 
sign of elements of ViS. This, and its generalization to networks with more than two communities 
is explained in next subsections. 



2.2 Networks with two communities 

For a network with two communities, we deem a two dimensional space (the so called projection 
space). The node i of the network is shown in this space by (x,?/) = (f i(i), f2(i)). A similar 
projection space was previously employed by Donetti and Munoz using the Laplacian matrix 
to unfold the community structure of complex networks [7] ^. We use the same notion for the 
dumpiness matrix. Projecting all nodes of the network in this space, it is observed that usually 
they are accumulated in two branches (see FIG. 1). Each group of points in this space is supposed 
to indicate one community. Therefore, one should show a way to divide the points into two groups. 
In this plane each node is alternatively identified by its polar coordinate (r, ^), where r = + y'^ 
and 9 = aictaji{y / x) . Our observations show that the component 6 of each node is appropriate to 
identify its community. In other words, the 6 components of nodes of each community are close 
to each other. Hence, we need a borderline with angle 0, to separate the two branches. We take 
into account four possibilities to define G : 

eMA = ^(C"" + r*"), (2) 

^For more information about the spectral clustering and properties of projection space interested reader can 
study Ref. [8]. 
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^MH — 2(^2/,'"""' ~^ Oyrain) , (3) 

QWA = , (4) 



,,rmn 



where 0'^°-^ and are maximum and minimum angles of points respectively, and Oymax and 9yr, 
are angles of points with maximum and minimum ordinates respectively. The quantity 6^1^ is the 
average angle and Qma is the maximum-minimum average. Observations show that using some 
combination of angles and ordinates of points to define the borderline leads to better results in 
community detection. This is the motivation to define borderlines Qmh and Qwa- Eq- (4) is a 
weighted averaging on angles of points, where weights are their ordinates, and it turns out that this 
gives better results than other definitions. An example of a simple graph having two communities 
is shown in FIG. 1. 



2.3 Networks with more communities 

In networks with more than two communities, the use of eigenvectors' relation is somehow different. 
The angle associated to each point i, which is arctan(^^), separates one community from another 
when there are two of them to be partitioned. This inspires us to assume that the ratio of next 
eigenvectors to Vi (eigenvector with greatest eigenvalue), can be used to find more communities. 
Suppose that the network is constructed from C communities, we use the first C eigenvectors of 
the dumpiness matrix to construct an n x (C — 1) matrix, F, in which the column j — 1 includes 
angles obtained from arctan(^^). That is: 

r,,,„i = arctan (^^) , J = 2, C. (5) 

Depicting points in a C — 1 dimensional projection space constructed from C — 1 columns of F, 
the points related to each community are accumulated in distinct clusters. This aggregation of 
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depicted points has also a physical justification which is explained in subsection 2.4. In fact the 
angle Tij^i gives the direction of point i with respect to a reference direction (i.e. direction of 
Vi(z)) in each Vi — Vj plane. Now a better result could be achieved if we calculate the angles 
between every two points without using such a reference and find accumulation of nodes with the 
same direction. In this case, we should use a C-dimensional space constructed of rows of matrix 
U: 

u„ = Vj(2), j = i,...,a (6) 

In order to partition the resulted clusters we used agglomerative hierarchical clustering algo- 
rithm [9, 1]. This algorithm starts with each point as a separate cluster and then progressively 
merge the points into bigger and bigger groups. Different methods such as simple-linkage, complete- 
linkage and average- linkage could be used to determine when two groups should be merged together. 
Here we used the average-linkage method as it led to better results. The rest of the procedure 
depends on choosing T or U. 

In the first case the Euclidean distance between points in the projection space should be derived 
from matrix T. Using the yielded distances, we build a cluster tree according to points' average 
proximity. Then, the C clusters associated with communities can be derived. In case of considering 
the other matrix for projection space, we are supposed to treat points as vectors and use angular 
distances instead. 

Note that it is feasible to apply this method to the case of networks with two communities too. 
However our observations show that defining the borderlines leads to a better performance in that 
case. On the other hand, in the higher dimensions of projection space it is not possible to define 
such a borderline or generalize it without a considerable loss in performance and accuracy of the 
method. Moreover, the use of this method in two dimensions helps to understand how the method 
works. 

It is worth mentioning that we need the number of communities to proceed with our algo- 
rithm. In some cases, a priori knowledge about this number exists. Otherwise we should obtain it 
somehow. A possible way (inspired by Donetti and Mufioz [7] and also used in [3]) is to calculate 
modularity [10] while enlarging the dimension of the projection space i.e. using two and then more 
eigenvectors. One can also use the gap between eigenvalues of directed Laplacian matrix [3]. This 
can be done rather fast by Lanczos algorithm [11] since a finite number of eigenvalues should be 
obtained. In addition, there are algorithms devised for this common problem [12, 13]. In our tests 
we do not take account of this problem and use the given number of communities. 

2.4 Qualitative analysis 

The outcome of this method can be explained by looking into eigenvectors and their mutual 
relations. Let speculate the network is consisted of nodes that interact with each other through 
hopping of some entities. Then one can correspond the following Hamiltonian to the network: 

H = J29^A\^){j\ + + , (7) 

id i 

where Wi acts like a localized potential which is considered to be zero in this paper for simplicity. 
It is rational to consider gij proportional to — That is, hopping between sites i and j is 
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Figure 2: Obtaining the inverse tangent of some data is like mapping them from one infinite space 
to a finite one. 



proportional to number of exiting ways from node i multiplied by number of entries to node j, and 
inversely proportional to square of nodes' distance. Hence, dumpiness matrix can be viewed as 
the negative of the Hamiltonian of the system. This Hamiltonian describes a system in which each 
site (corresponding to a node of the network) interacts with all other sites. Therefore, Vi would be 
the ground state of the system, which is also the most ordered state, i.e. all components of Vi are 
positive. By increasing the energy of the system its initial order diminishes. The first exited state, 
V2, encompasses two regions of opposite spin-like states. The regions are expected to coincide with 
communities just in case of a network with two communities, since there are more and stronger 
interactions among the members in each group than interactions between nodes in different groups. 
For networks with more than two communities one should take into account more eigenvectors (i.e. 
more excited states). When the system gradually advances to higher states, interactions make sites 
in each community to behave alike each other. In the Cth level, short range order is held in C 
distinct regions. 

These regions are more distinguishable if we compare their order with the order of the ground 
state. This is best achieved taking into account the ratio of eigenvector components of each level 
to those of the ground state (Vj/Vi). By taking the inverse tangent of these ratios, we restrict the 
maximum values, making them closer together (FIG. 2). Accordingly, columns of matrix T tell us 
how sites have changed with respect to others in each level. As a result, clusters of points in C — 1 
dimensional space indicate the most related nodes, defining the optimum borders of regions. 

For networks with heterogeneous community size distribution the situation may differ. In a 
very small community, nodes have lower degrees (i.e. fewer connections) with respect to nodes 
in a large community. For small mixing between communities, nodes in the small community 
are more connected to the peer nodes with lower degrees, while nodes in the big community 
have many connections with high degree nodes. Hence, nodes in small communities have low 
valued components in corresponding eigenvectors, i.e. they are less occupied, with respect to 
others in large communities which could have a broad range of component values in corresponding 
eigenvectors. As a result, the branch related to a small community will become much shorter 
and its points accumulate near the origin so that they all mix with the points related to the 
large communities. On the other hand, for large mixing between communities, more connections 
between nodes of different communities will adjust the heterogeneity and the size effect will be 
reduced. FIG. 3 shows examples of this situation. 

The networks in FIG. 3 are constructed using an algorithm inspired from LFR benchmark [14]. 
This algorithm generates networks with power-law degree distribution and two communities in 
which mixing parameter and the sizes of communities are arbitrary. The maximum and average 
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Figure 3: Samples of different situations on several realizations of a benchmark network with two 
communities. In (a)-(c) the community sizes are 50 and 250, and in (d) both communities have 
150 nodes. 



mixing parameter (//) is indicated in each case. FIG. 3a and FIG. 3b are samples of the projection 
space (U) for two different situations happened in a number of realizations of a network having 
communities with 50 and 250 nodes and low mixing parameter between them. The degree distri- 
bution power is —2 and minimum and maximum degrees are 20 and 100 respectively. In FIG. 3a 
the points are well separated in two distinct branches. On the other hand, in FIG. 3b the bigger 
branch is extended while points of the small community are accumulated near the origin. In 50 
realizations of the network 8 of them was related to this situation. For a larger mixing parameter 
(FIG. 3c) this situation is less probable (1 out of 50). The two communities in FIG. 3d have the 
same size. This, results in perfect formation of two branches even for small mixing parameter in 
all 50 realizations. 

One can also use a similar argument about the performance of Laplacian matrix on detecting 
the communities. In this regard, there is a difference between behaviours of Laplacian and dumpi- 
ness matrices. This difference results from the type of interaction that each matrix represents. 
The dumpiness matrix assumes a global interaction between the nodes, on the other hand the 
Laplacian matrix represents interactions between the neighbors plus a potential well in each site 
(i.e. the diagonal elements). Because of the tight-binding interactions in the Laplacian matrix, 
the occupation of each network's site (i.e. element of an eigenvector) only depends on its adjacent 
sites. In small mixing parameters this leads to good separation of communities. However, when 
the number of outer and intra links are comparable, the elements of eigenvectors will be very close 
and mixed together so that dividing the clusters is hard. 

On the other hand, due to the infinite range of interactions in dumpiness matrix, each site is 
occupied according to its interaction with all the network. The nodes with more connections and 
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stronger interactions are more occupied. Hence, these nodes are placed at the end of the branches 
corresponding to their communities in the projection space. By increasing the mixing parameter 
the branches approach each other; this approach is more appreciable near the origin than at the 
end of the branches. Therefore, in larger mixing parameters the nodes which are far from the 
origin can be well separated. Additionally, the nodes which are not close to boundaries of two 
branches are still separable. This is in contrast with the Laplacian case in which the difference 
between different sites is not appreciable. This results in better performance of dumpiness matrix 
in large mixing parameters. 



2.5 Time complexity of the method 

The running time of our algorithm depends on the size of the system. Below we briefly discuss 
this dependence for different parts of the algorithm. 

In order to obtain the dumpiness matrix one needs to calculate all shortest paths of the 
network. Two popular algorithms for this purpose are Floyd- Warshall with 0{n^) time complexity 
and Johnsons with 0{'n?log{m)) complexity, with n and m being the number of nodes and links 
respectively [15, 16]. The average- linkage hierarchical clustering method which is used in our 
algorithm has a time complexity of 0{Tn?log{n)) [17]. The complete-linkage method used by Donetti 
et al [7] has the same complexity, but according to our observations average- linkage leads to 
better results. Finally we need to obtain a finite number of dumpiness matrices' eigenvectors 
corresponding to its largest eigenvalues. This can be done by algorithms based on Lanczos method 
[11] which their running time have a 0{'n?) dependency on the size of the matrix [18]. The 
dependency of this method on the number of iterations and eigenvectors is negligible as they are 
finite. Hence, the time complexity of our algorithm is 0{'n?log{m)). 



Normalized Mutual Information 

Normalized mutual information (/(A, S))[19], which provides a good inspection on the discrimina- 
tion power of detection methods, is based on intersection matrix N. The entry Nj^ of N, equals to 
the number of nodes in detected community j common to real community i. In a network with the 
number of built in communities c^, let cb be the number of detected communities. Then /(A, B) 
is defined as below: 

... ^ -2E-^iEgiAr,.ln(iV,.iV/iV..iV,) 

^ ' ^ E^^i N.. HNJN) + 1 N, HN,/N) ' ^ ^ 

where is the total number of nodes, iVj, the sum over the row i and N,j the sum over column 
j of N. In our method we suppose that the number of communities are given initially, therefore 
ca and cb are equal. The normalized mutual information is a number between zero and one, and 
greater values of /(A, B) mean the better community detection. 



3 Computational results 

In this section we apply our community detection method on some artificial and real-world net- 
works. 
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Figure 4: (a) An artificial graph with two communities, (b) Projection space 

3.1 Networks with two communities 

3.1.1 Artificial networks 

Our first example (FIG. 4a) is a network with two communities having 30 and 20 nodes respectively. 
The connection probability between the members of each community is 0.5 and for those in different 
group is 0.1. FIG. 4b shows the projection space of this network with the borderlines. FIG. 4b 
illustrates that the borderline WA identifies the communities more accurately. Borderlines MH 
and MA have one misidentification, while AA has 2 misidentification. 

By applying our method on benchmark networks [14], its accuracy will be observed more 
precisely. On benchmark networks, we can analyze the boundary of accuracy by calculating the 
variation of normalized mutual information with respect to different network parameters. In FIG. 5 
the variation of normalized mutual information with mixing parameter fi (the ratio of inter to intra 
links) for networks with two communities is depicted. The benchmark networks have 100 nodes 
and the initial parameters are defined as follows: degree distribution exponent 7 = 3, maximum 
degree K^^ax = 50, minimum community size Smin = 30 and maximum community size Smax = 60. 
In this figure we can see the average I{A, B) for WA and AA definitions. The two other definitions 
have /(A, B) between WA and AA lines. We see that the discrimination power decreases with the 
increase of yU. It should be noted that in these benchmark networks for /i > 0.4 the communities 
are not well defined any more, according to the planted ^-partition model [20, 21]. 

3.1.2 Real networks 

In addition we applied our method to the real networks of Zachary's karate club (FIG. 6) [22] and 
dolphin social network (FIG. 7) [23]. There is no preference among the four borderlines in karate 
club network. However, in dolphin social network, the weighted average (WA) seems to work more 
properly, while the usual average (AA) has the lowest distinction power. 
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Figure 5: Variation of normalized mutual information against mixing parameter for networks with 
2 communities with average degrees 25 and 15. The other network parameters are as follows: 
= 100, 7 = 3, Kmax = 50, Smin = 30, and S^ax = 60. 
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Figure 6: (a) Karate Club network, (b) Projection space. All four definitions for yield the same 
borderline and detect the two communities correctly 
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Figure 7: (a) Dolphin social network, (b) Projection space. WA has the best result with only one 
misidentification 

3.2 Networks with more communities 
3.2.1 Test on the benchmark networks 

We used LFR benchmarks [14] to examine the performance of the algorithm on networks with 
different parameters. FIG. 8a illustrates the performance of algorithm on networks with the average 
degree Kave = 20, the power-law exponent of degree distribution exponent 7 = 2 and community 
size distribution /3 = 1. Networks with 1000 and 5000 nodes are considered. In addition, two 
ranges are considered for minimum and maximum community sizes: averagely small community 
sizes {Smin = 10, Smax = 50) and averagely big community sizes {Smin = 20, Smax = 100) which 
are indicated by signs S and B respectively. 

A better performance is observed for larger networks (A^ = 5000) according to normalized 
mutual information I{A, B). In case of smaller communities and larger networks {A^ = 5000, 5*}, 
/(A, B) slightly increases with yU up to /i < 0.5,. This is because of the fact that size resolution effect 
(resulted from large number of very small communities with average degree fewer than Kave = 20) 
decreases when mixing parameter grows. For /i > 0.6, the high mixing between communities 
dominantly makes it harder to detect communities, causes the decrease in I{A,B). 

In FIG. 8b, the size resolution effect is investigated for networks with 1000 nodes and different 
average degrees (/i = 0.5, 7 = 2, /3 = 1) by considering Smin = 20 and plotting I{A,B) against 
Smax- The results are depicted for different choices of projection space (U or F). 

3.3 Improving the method 

The hierarchical clustering algorithm constructs the communities according to the proximity of 
points in the projection space. In each step of this algorithm the two closest clusters of points are 
merged together, where the closeness of each pair of clusters is defined according to a similarity 
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Figure 8: (a) Variation of normalized mutual information against mixing parameter in networks 
with 1000 and 5000 nodes, resulted from using (U) as projection space, (b) The effect of community 
size heterogeneity on I{A, B) in networks with 1000 nodes. The other network parameters are: 
7 = 2, /3 = 1, Kmax = 50, and in (a) S stands for {Smin = 10, Smax = 50) and B for (S"™™ = 



20, Sr, 



100). 



measure ^. We are interested in a merging procedure in which there is a priority for joining clusters 
that are also closer to each other in the real network. In their spectral algorithm, Donetti and 
Muiioz [7] take this into account by inserting a constraint that two clusters are merged only if there 
is a link between them in the network. Implementing this constraint will improve the performance 
of our algorithm as well. However our observations show that a considerably better performance 
is achieved by simply altering the initial similarity between pair of points as follows: 

Sim{i,j) = Sim{i,j) * d^^. (9) 

Here Sim{i,j) is the similarity between points i and j and dij the length of the shortest path 
between nodes i and j. This change does not affect the main body and speed of the algorithm. 



3.3.1 Performance of the method on LFR benchmarks 

In FIG. 9 the performance of the algorithm achieved by this improvement is illustrated. The 
network parameters are the same as FIG. 8. Now the method has better performance and the 
negative effect of heterogeneity of the community sizes is appreciably decreased. This is because 
even if some clusters are close to each other in the origin of projection space, the nodes in this 
region have less interactions with the rest of the network, especially with nodes other than their 
own community, hence they are more distant from nodes of other clusters. Accordingly, these 
clusters are not mixed any more with the new definition of similarity. In addition, our method 
still works better for lager networks of 5000 nodes. In the inset of FIG. 9 the performance of 
the algorithm by using the Laplacian matrix instead of dumpiness matrix is illustrated. In small 

^In the group-average clustering method we used, the similarity of two clusters equals the average of distances 
between all pairs of points in the two clusters. 
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Figure 9: Performance of our algorithm against mixing parameter in networks with 1000 and 5000 
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mixing parameters both matrices work well, however in large mixing parameters dumpiness matrix 
has a better performance. 

The networks constructed for FIG. 9 have the same parameters as the LFR networks used in 
the second figure of Ref. [21] to compare the performance of several methods. Hence, a comparison 
between our method and some other methods with high performances is possible. In comparison 
with the algorithm of Donetti and Munoz (DM) [7], our algorithm has an appreciably better per- 
formance for /i > 0.6. For fi < 0.6 our method and DM has nearly the same performance except for 
the {A^ = 5000, S} networks in which DM shows a considerable decreased performance. Compared 
to the inset of FIG. 9, the DM algorithm should work better in this region. This difference can 
be due to the error in finding the number of communities and the differences in employing hierar- 
chical clustering algorithm. Additionally it is likely that a local maxima of modularity is achieved 
in the realizations of the DM method, as in the case of {A^ = 5000, S} networks the number of 
communities is much more, one should consider a larger maximum number of communities (D) in 
the DM method to obtain the global maxima of modularity. 

In Ref. [21] the infomap method [24] was shown to have a great performance with I {A, B) = 1 
up to /i = 0.6 for all networks, however it yields I{A, S) ~ for {A^ = 1000, B, jj > 0.65}, {A^ = 
1000, S, n> 0.75}, and {A^ = 5000, 5, /i ^ 0.8}, and I{A, B) < 0.4 for {A^ = 5000, ^, /i ~ 0.8}. 
In comparison, our method yields values of I{A,B) close to 1 (the least {I{A,B)) ^ 0.95) up to 
/i = 0.5 and in large values of mixing parameter when infomap yields very small values of I{A, B) 
our method has an appreciably better performance. 
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Figure 10: (a) A graph constructed from interactions between proteins. This network is a sub- 
graph of S.cerevisiae protein-protein interaction network [26] and shows the three communities 
assigned to the protein Zdsl (node number 8 showed by a yellow square), (b) corresponding 2-D 
projection space. The dashed lines in (a) indicate the three detected communities which are in 
accordance with real data. 

3.3.2 Real- world examples 

In this part the method is used to illustrate the communities in several real networks which have 
3 or more communities. 

Protein-protein interaction network 

As our first example, we considered communities in a graph constructed from interactions between 
proteins (FIG. 10a) [25]. This biological network is comprised of three communities and members 
of each community either have a certain function or correspond to a protein complex [25]. The T 
projection space of this network is 2-dimensional (FIG. 10b). The accumulation of points in three 
groups can be observed in FIG. 10b, which is in accordance with real communities. 

Citation network 

The network of journal citation reports [27, 28] is an example of the case which we can have 
a good guess at the number of communities. This informational network is constructed of 40 
journals in four fields of physics (circles), chemistry (yellow squares), biology (triangles), and 
ecology (dark blue squares). In FIG. 11 we showed the partitioning resulted from the infomap 
method [24] and our method by guessing the number of communities to be 3 or 4. The infomap 
code separates the network into 3 communities, however by taking into account the weights of the 
links it detects 4 communities correctly, apart from an excess group of 2 nodes. Our algorithm 
detects the communities precisely according to the given number of them. 
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Figure 11: Community structure of a citation network. The solid and dashed hues indicate the 
partitions resulted from our algorithm given the number of communities to be 3 or 4 respectively. 
The solid line also shows the partitions found by infomap method. The additional dashed and 
dotted lines indicate the result of infomap method on the weighted version of the network. 

American collage football network 

Another example is a social network of American collage football teams (FIG. 12) [29]. In this 
network the links show the games between 115 teams in year 2000. The teams were grouped 
in 12 conferences and most of the games were held between the teams of the same conferences. 
FIG. 12a shows the result of our algorithm when the network is assumed to have 12 communities; 
the modularity [10] for this partitioning is Q = 0.6005 and by assuming each conference as a real 
community the algorithm gives a normalized mutual information equal to I{A, B) = 0.9242. The 
infomap method identifies 12 communities in the network and gives the same value for I{A,B). 
However the maximum modularity is obtained for a partitioning with 10 communities {Q = 0.6046) 
that yields a normalized mutual information equal to I{A, B) = 0.9522. 

Word association network 

Our final example is a graph derived from the word-association network of University of South 
Florida Free Association Norms [30]. The links between words in this network are constructed 
using an experiment. In the experiment, cue words were given to 6000 participants and they were 
asked to introduce the first word that came to their mind [target word) after seeing each cue word. 
There is a link between two words if they have a cue-target or target-cue relation. 

The nodes in our graph (FIG. 13) are the neighbors of the word bright and there is a link 
between two words if one of them is mentioned at least 3 times as the target word. There are close 
relation between theses words (i.e. semantic, syntactic, etc), however the graph can be partitioned 
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(a) 



(b) 



Figure 12: The American collage football network [29]. The teams of each conference are shown 
with distinct colors and gathered in circles. The square boxes show the partitions which were 
different from conferences. Other communities are identical with conferences, (a) Communities 
detected by considering 12 communities for the network, (b) Partitioning the network into 10 
communities yields the maximum modularity oi Q = 0.6046. 

to obtain groups of words which are practically more similar and have closer function. FIG. 13 
shows a good partitioning of the graph into 6 communities with modularity Q = 0.4479 resulted 
from our algorithm. 

4 Concluding remarks 

In this paper we introduced a method for detecting communities of networks based on spectral 
properties of dumpiness matrix and the projection space corresponding to eigenvectors of this 
matrix. Our analysis shows that the method gives accurate results for many computer-generated 
and real-world networks. In benchmark networks, the method maintain a good performance even 
in large mixing parameters. The accuracy of the method was checked by normalized mutual 
information. The method finds communities in just one step not in a hierarchical manner. We 
also discussed qualitatively the effectiveness of the method by looking at the dumpiness matrix 
as the Hamiltonian of the system. The effect of size heterogeneity is also discussed and tested on 
some benchmarks. It is also observed that the algorithm gives better results for larger networks. 

It is observed that the method can be appreciably improved by employing a minor change 
in the hierarchical clustering method. The improvement of algorithm in other aspects is also 
possible. For example one can try to build a graph matrix whose eigenvectors can better highlight 
the communities. This can be done by substituting another centrality measure (e.g. betweenness, 
closeness, random walk centrality [1]) instead of nodes' degree, which is more related to the role of 
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Figure 13: A word graph constructed from the relations between neighbors of the word bright in a 
free association network. The nodes of each community, detected by our method, are plotted close 
together and have the same color. There are close relations between words of the graph, however 
words of each community are practically more similar. 

nodes in communities. Additionally other suitable distance measures can be used (e.g. one based 
on random walks introduced in [31]). One can also change the exponent of the dij in the elements 
of the dumpiness matrix and assign an arbitrary value a to it. By the current distance definition 
our observations, show that the effect of varying a between 0.5 and 3.5 on I{A, B) is not greater 
than A(/(A,5)) = 0.02. 

It is worth mentioning that the extension of our method to the case of weighted networks is also 
possible. To this end one should replace the degree of a node with the sum of its links' weights. 
The shortest path length between two nodes can be defined in various ways depending on the 
structure of the network and meaning of weights. This is the subject of further investigation. 
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